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Abstract — Optimizing energy efficiency (EE) for the MIMO 
broadcasting channels (BC) is considered in this paper, where 
a practical power model is taken into account. Although the 
EE of the MIMO BC is non-concave, we reformulate it as a 
quasiconcave function based on the uplink-downlink duality. 
After that, an energy efficient iterative waterfilling scheme is 
proposed based on the block-coordinate ascent algorithm to 
obtain the optimal transmission policy efficiently, and the solution 
is proved to be convergent. Through simulations, we validate 
the efficiency of the proposed scheme and discuss the system 
parameters' effect on the EE. 

Index Terms — Energy efficiency, MIMO broadcasting chan- 
nels, iterative waterfilling. 

I. Introduction 

Wireless communication turns to the era of green. This is 
not only because of the exponential traffic growth with the 
popularity of the smart phone but also the limited energy 
source with ever higher prices. Energy efficiency (EE), as a 
result, becomes one of the major topics in the research of 
wireless communications H| and plenty of research projects 
either government funded or industrial funded start to investi- 
gate the energy efficient solutions for the wireless network as 
well as the sustainable future for the wireless communications. 
Meanwhile, multiple input multiple output (MIMO), especially 
downlink multiuser MIMO (also called MIMO broadcasting 
channels, BC), has become a key technology in the cellular 
networks due to its significant spectral efficiency (SE) im- 
provement. Therefore, studying the EE of the MIMO BC is a 
critical issue. 

The EE is in general defined as the capacity divided by the 
power consumption, which denotes the delivered bits per-unit 
energy measured in bits per- Joule. There are a lot of literatures 
discussing the EE of the point to point MIMO channels 
|2|-[7|. The point to point MIMO channels can always be 
separated into parallel sub-channels through singular value 
decomposition (SVD) or after detection. In this case, only 
power allocation across the sub-channels needs to be optimized 
to compromise the transmit power and circuit power, and thus 
maximize the EE [3]-[5|. As the sub-channels are parallel, the 
solution is similar with the energy efficient power allocation 
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in OFDM systems (8), (9). The optimization for point to 
point MIMO channels is not applicable for the MIMO BC, 
as the MIMO BC cannot be simply transformed into parallel 
sub-channelfl There are few literatures discussing the EE 
for the MIMO BC. To the best of the authors' knowledge, 
only 0111 and our previous work lfl2l addressed this topic, 
but they both assumed linear precoding design and equal 
transmit power allocation for simplification. The assumption of 
linear precoding makes both works far away from the optimal 
solution. To optimize the system performance (both SE and 
EE) of the MIMO BC, the precoding matrices and power 
allocation should be jointly decided. For instance, the common 
way to achieve the maximum sum capacity is employing the 
dirty paper coding (DPC) and iterative waterfilling 1101 . lfl3l - 
lfl5l . From the standpoint of EE, joint precoding and power 
allocation design should also be carefully designed to achieve 
the optimum, which is the main concern of this paper. 

We consider the problem of optimizing EE with joint 
precoding and power allocation for the MIMO BC, where 
a practical power model including signal processing, circuit 
power, etc. at the base station (BS) [12|, [16| is taken into 
account. After formulating the maximum EE problem, we find 
that the energy efficient joint precoding and power allocation 
design is equivalent to only optimizing the transmit covari- 
ance matrices. However, optimizing the transmit covariance 
matrices is difficult, as the EE function achieved by DPC is 
nonconcave. Fortunately, we find an efficient iterative solution 
based on uplink-downlink duality and the contributions are 
summarized as follows. 

Contributions: We transform the EE into a quasiconcave 
function through employing the famous uplink-downlink dual- 
ity. The duality transforms the nonconcave MIMO BC capacity 
into the dual convex MIMO multi-access channels (MAC). 
After that, we propose a novel energy efficient iterative water- 
filling scheme based on the block-coordinate ascent algorithm 
to solve the quasiconcave EE optimizing problem efficiently. 
During each iteration, the transmit covariance matrices opti- 
mization is formulated as a concave fractional program, and 
solved through relating it to a parametric concave program and 
then applying the Karush-Kuhn-Tucker optimality conditions. 

Although after zero-forcing (ZF) precoding, for example, the MIMO can 
be separated into parallel sub-channels, the ZF scheme is far away from the 
optimal solution I T 01 . 





Fig. 1. System model of the MIMO BC (left) and its dual MIMO MAC 
(right). 



Interestingly, the solution of each iteration has a feature of 
waterfilling. We prove the convergence of the proposed scheme 
and validates it through simulations. Moreover, the system 
parameters' effect on the EE is discussed finally. 

II. System Model 

The system consists of a single BS with M antennas and 
K users each with N antennasj, which is shown in Fig. Q] 
The downlink channel can be denoted as 



y, = H,x + rii,i = l,...,K, 



and the dual uplink channel is denoted as 

K 

ymac = E H f X 4 + n > 



(1) 



(2) 



where H; e C 



NxM 



is the channel matrix of the ith user, 



x e 



■<Afxl 



is the transmitted signal on the downlink, 6 

Nxl 



<C Nxl is the transmitted signal on the uplink, € ^ 
and n € C Mxl are the independent Gaussian noise with 
each entry CJ\f(0,a 2 ). Frequency fiat fading channels with 
bandwidth W is considered and the channel state information 
(CSI) is assumed to be perfectly known at the transmitter and 
receivers. 

About the power model, as BSs take the main power 
consumption in the cellular networks, the users' consumed 
power is omitted. The BS power model is motivated by [12|, 
llT6l . As the power radiated to the environment for signal 
transmission is only a portion of its total power consumption 
|[T6l . the practical circuit power, signal processing power, 
cooling loss etc. at the BS should be taken into account. 
Without loss of generality, given the total BS antenna number 
M and total transmit power P, the total power consumption 
of a BS can be denoted as 



Petal =-+MP. 



V 



dyn 



(3) 



2 The results here can be extended to the general case with different antenna 
number at each user. Moreover, the results are also applicable to the multi-cell 
scenario with BS cooperation. 



where r\ denotes the power amplifier (PA) efficiency; MPdyn 
denotes the dynamic power consumption proportional to the 
number of radio frequency (RF) chains, e.g. circuit power 
of RF chains which is always proportional to M; and P sta 
accounts for the static power independent of both M and P 
which includes power consumption of the baseband process- 
ing, battery unit etc.. 

III. Achievable Sum Capacity and Energy 
Efficiency 

The sum capacity of the MIMO BC is achieved by DPC, 
which can be denoted as follows [ 1 1 given a total transmit 
power P. 



Cbc (Hi, . . . , Hjf, P) 



max W log 

{'S i }^ =1 :'Si>0,J2f =1 Tr(Si)<P 

I + 4H 2 (Si + S 2 )Hf I 



I+^H^Hf 



+W\og 
+Wlog 



^Hjf (Si + • • • + Sif) H K 



T H K (s a 



J K-1 



Hf| 



(4) 

where the optimization is performed to choose the optimal 



downlink transmit covariance matrices £ C MxM ,i = 
1,,..,K. 

The EE is defined as the achievable sum capacity of MIMO 
BC divided by the total power consumption at the BS, which 
can be denoted as 

Cbc (Hi,..., H*, P) 



Cbc (Hi 



% + MP dyn 



P«i 



(5) 



under a fixed transmit power P. Based on (O, the optimal EE 
for the MIMO BC can be obtained through optimizing 
which is denoted as 

C BC (H!,...,H a -,P) 



Cbc (Hi, .... H K ) 



max 

P:P>0 



- + MP 



dyn 



(6) 



In the above optimization problem, transmit power level P 
and transmit covariance matrices Sj G C MxM , i = 1, . . . , K 
need to be jointly optimized. Note that although no maximum 
transmit power constraint is considered in ©, the solution can 
be easily extended to the constrained case based on JU, lfl8ll . 
We omit the extension here due to page limit. 

As (0 is nonconcave, optimizing (O is nontrivial. Fortu- 
nately, motivated by |[T9l . ll20l . we find out that the following 
property. If the numerator (sum capacity) can be transformed 
into a convex function, the EE can be formulated as a 
quasiconcave function, because the denominator (total power 
consumption) is affine. Based on this observation, we try to 
transform the sum capacity into a concave function. 

Applying the uplink-downlink duality H3l, the MIMO BC 
sum capacity (0]l is equal to the concave sum capacity of the 

3 Indeed, the EE of MIMO BC is highly affected by the transmit power P 
and transmit antenna number M, and jointly optimizing P and M is required 
to maximize the EE. We only consider the optimization of P in this paper, 
and the M optimizing to further improve the EE based on active transmit 
antenna selection can be found in 1171 . 



MIMO MAC © with sum transmit power constraint, which 
can be denoted as 



Cmac (Hf , . . . , Hf , P) 



max W log 



I+4£H?QiH, 



^NxN 



1 = 1,. 



(7) 

e 

, K need to be optimized. Thus, the dual 



where the uplink transmit covariance matrices Q; 



MAC optimal EE should be rewritten as 

Cmac (Hf , . . . , H K , P) 



Cmac (Hi, . . . , Ha-) 



max 

P:P>0 



£ + MP, 



dyn 



(8) 

According to the duality and the mapping between Sj and 
Qi lfl3l , optimal Sj,i = 1,...,K and P can be obtained 
if we can get the optimal Qi,i = l,...,K and P in 

©. Furthermore, since the maximum power constraint is 

K 

not considered and ^ Tr (Qj) = P is always required for 
optimizing (0 [14], ([8]) can be simplified and rewritten as 

,Hf) 



Cmac (Hf , 



VFlog 



1 + 



E Hf QiH, 



(9) 



Finally, the optimization of (|6]l is transformed into optimizing 
10. If we can obtain the optimal Qi,i = for 
(0, the optimal P can be decided correspondingly based 

K 

on ^Tr(Qi) = P. That is to say, designing the joint 

precoding and power allocation to maximize EE is equivalent 
to optimizing the transmit covariance matrices. Once transmit 
covariance matrices are decided, the transmit power level can 
be correspondingly determined. 

Let us look at (0 again. Since the numerator is concave and 
the denominator is affine, (0 is a quasiconcave optimization, 
which can be solved through the bisection method or interior- 
point methods. However, the numerical methods would be 
still too complex when the user number becomes significantly 
large. Motivated by fl4l . IIT51 . an energy efficient iterative 
waterfilling is proposed in the next section to solve it more 
efficiently. 

IV. Energy Efficient Iterative Optimization 
A. Motivation 

As the EE is distinct from the capacity, the spectral efficient 
iterative waterfilling lfl4]| is not applicable for the EE any 
longer. Nevertheless, we notice that the basic idea of the 
spectral efficient iterative algorithms are based on the block- 
coordinate ascent algorithm [21, Sec. 2.7]. That is to say, if 
we can write the EE as the similar structure with the block- 
coordinate ascent algorithm and then prove it satisfies the 
condition of ||2T1 Sec. 2.7], we can obtain an iterative solution 
of the problem (0. 



For ease of description, we define the following function 
g (•) at first. 
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Wlog I+4j J2 Hf QiHi 

T.f =1 Tr(Qj) 



(10) 
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For the block-coordinate ascent algorithm, given the current 
iterate QW = (Q^\ . . . , Q^ } ), the next iterate Q( fc+1 ) = 

Q^ fe , . . . , Qj£ +1 ^ ) can be generated as 



Q 



(k+i) 



arg max q 

Qi:Q,>0' 



f n (k+l) n (fc+l) n n (fc) n 



(fc) 

K 

(11) 

However, to apply the iterative algorithm efficiently, there are 
conditions need to be satisfied. For one thing, the solution of 
(fTTT > should be uniquely attained [21 Proposition 2.7.1]. For 
another, the solution should be simple and easy to employ. 

Very fortunately, the two conditions both fulfill and the solu- 
tion can be obtained following an energy efficient waterfilling 
feature. We are interested to show it in the next subsection. 

B. Energy Efficient Waterfilling 

Based on JT4), fl3), it is fulfilled that 



log 



A" 



l+jpr EHfQ 4 H, 



: log 

log 



" • - E Hj'Q 'I 
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(12) 
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By denoting 
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h = W\og\Z t \ 
and substituting dT2l > into < TT~0b we have that 

fei + Wlogll + GfQiGi 



9(Qi,---,Qk) = 



Tr(Q, 



(13) 



Therefore, we can redefine the problem ( fTTb by removing 
the iteration number as to 



maximize q (Qi 
Q^Q^o J 



, Qi i, Qi, Qi+i, • ■ • , Qa-) 
fci + Wlogll + GfOiGil (14) 



Tr(Q.) 



by treating Qi, . . . , Qj_i , Qi+i, . . . , Qk as constant. Based 
on section IIV-AI we need to solve the above problem and 
prove that the solution is unique. 

Since the numerator and denominator in ( fT3l i are concave 
and affine respectively, ( TBI is a concave fractional program 
[ 20 1 . Define a non-negative parameter A, ( Tl4T > is related to the 
following convex function separating numerator and denomi- 
nator with help of A. 

F(Qi, X) = bi + Wlog\l + Gf QiGj I - A ^^3*) 



(15) 



(16) 



And then define a convex optimization problem as 

Y(X)= max F(Q;,A). 

We will try to solve (fTST l. and then the solution of ( TT4"1 > can be 
obtained correspondingly based on the following Theorem. 

Theorem 1: The optimum feasible transmit covariance ma- 
trix Q* achieves the maximum value of (fl4] i if and only if 

Y{\*) = F(Q|, A*) = maxF(Q,, A*|Q, > 0) = 0. 
Proof: See Appendix lAl 

Theorem Q] gives us insights to solve (fl4] i. We should 
optimizing ( [ToT l at first under a given A and then solve the 
equation Y(X) — to get the optimal A. 

To solve ( [ToT l, we can denote 



Gf d = UD,U ff 



(17) 



based on the eigenvalue decomposition at first, where T>i £ 
C MxM is diagonal with nonnegative entries and U G C Mx 
is unitary. Without loss of generality, we assume that has 
L non-zero diagonal entries (1 < L < M), which means 
[T>i] kk > for k = 1, . . . , L and [D^ = for k = L + 
l,...,M. 

And then we have the following equation based on 
1 + AB = I + BA QI: 

log |I + Gf QjGj| = log |I + QiGf Gi| 
= log |I + QiUDiU^ | = log |I + U^Q.UD, 

Define = U H Q S U. As U is unitary, we have that Tr(Sj) 
Tr(Qi). Thus, ( fT3T > can be rewritten as 



(18) 



G(S l ,A) = 6 J + VKlog|I + S J D s 



- A h a. 

V V 



(19) 

As each S,; corresponds to a via the invertible mapping 
Si = XJ H QiXJ, solving ([ToT l is equivalent to solving the 
following convex optimization problem. 

r(A)= s^>o G(Si ' A) ( 20 ) 
It is proved in the Appendix [B] that the optimal S* to 
solve (|20T > is diagonal with [S*]kk > for k = 1,...,L 
and [S*] kk = for k = L + 1, . . . , M. Thus, G(S 4 , A) with 
diagonal is 



G(S,,A) = h 



W E i«g(i 

fe=i 



[Sj]fcfc[Dj]fcfc) 



E [Si 

fe=l 



(21) 



As (fSTJ is concave in Sj, the problem d20l ) can be solved 
for a given A by solving the Karush-Kuhn-Tucker optimality 
conditions, and the solution can be denoted as 



n 

ln(2)A 



TD7 



1, 



.,L, 



(22) 



where [x] + = max(a;, 0). Then the water level A* can be 
decided by setting Y(\*) = based on Theorem Q] as 



bi + E log ( 1 

k=l 

-A* x 




(23) 



As Y(X) is strictly decreasing, and F(0) = oo, F(oo) 
— oo (see detailed proof in Appendix [Cj, we can solve 
efficiently based on the bisection methods. 

Based on d23l and (l22l . the optimal S* A is derived. Based 
on the mapping between S, and Q^, finally, the optimal 
solution of (fT~4T > can be derived as 



Q* - US* A *U ff . 



(24) 



To prove that the solution of ( fT4b is unique, we only need 
to prove that A* is unique. We give the following Theorem 
and the proof is given in the Appendix ICl 

Theorem 2: The derived water level A* in d23l ) is unique 
and globally optimal. 

To make the description more clearly, we summarize the 
energy efficient waterfilling algorithm for optimizing dT3b in 
TABLE HI 

TABLE I 

Energy Efficient Waterfilling Algorithm 



1) Calculate Zj 



Hi 



2 I+EHfQ J H, 



MP. 



-1/2 



2) 



dyn " 



Ata, k = H/log|Zi| 
Define the related parametric convex program in 1151 and {16}; 

3) Transform the parametric convex program into diagonal forms 
119) and {20} by performing eigenvalue decomposition in l |17jl : 

4) Solve {20} by solving the Karush-Kuhn-Tucker optimality con- 



ditions and obtain the solution S* 



{23; 



5) Calculate the energy efficient water level A* based on {23} and 



determine the optimal S 
6) Obtain Q* based on the mapping {24} finally; 



C. Iterative Algorithm 

Based on the derivation in section IIV-BI and the block- 
coordinate ascent algorithm, the energy efficient iterative wa- 
terfilling scheme can be derived as shown in TABLE [TT1 and 
the proof of converge is given as follows. 

Proof of converge: Firstly, during each step, the energy 
efficient waterfilling can achieve an global maximization of 
< fT~4T > treating the other users' transmit covariance matrices as 



TABLE II 

Energy Efficient Iterative waterfilling Scheme 



Energy Efficiency vs. BS Antenna Number 



Initialization: Set Qj = 0, i = 1, . . . , K., 
Repeat: 

For i = 1 : K 

1) Calculate Q* based on the energy efficient waterfilling algorithm 
in TABLE |Tj 

2) Refresh Qi as Q*; 
End 

Until the EE converges. 



Energy Efficiency vs. iteraton 



10 15 
Iteraton 



Fig. 2. EE converge behavior of the proposed scheme. 



constant, the EE is non-decreasing with each step. As the EE 
is bounded, the EE converges to a limit. 

Secondly, according to Theorem [TJ and Theorem [2] the 
derivation of each step is unique. Based on ETl Sec. 2.7], 
the set of Qi, . . . , Qk also converge to a limit. □ 

Note that as the proof does not depend on the starting 
point, we can start the algorithm from any starting values of 
Qi, . . . , Qk- To show the efficiency of the proposed scheme, 
we give the simulation results in the next section. 

V. Simulation Results 

In the simulation, large scale pathloss and small scale 
Rayleigh fading are considered. The parameters are set in 
TABLE |nl] based on lfT2l and all users are with the same 
distance. In Fig. [2] the converge behavior of the proposed 
scheme is shown and it is set that d = 1km, M = 4, N = 4 
and K = 10. We can see that our proposed iterative scheme 
converges very fast. It can achieve the optimal EE under nearly 
five iterations. 

Fig. [3] compares the effect of M on the EE with different 
K, where N = 1, d =lkm are considered. Different from 
the SE which is always increasing linearly as M increases 
when M < N x K, the EE increases much slower or even 
decreases. For example, when K = 8, the EE with M = 7 is 
better than M = 8. The reason comes from the effect of the 
practical dynamic power MPa yn - In this case, increasing M 
can cause the linear increasing of both capacity and dynamic 
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Fig. 3. The effect of antenna number M on the EE, where N = 1, d =lkm 
are considered. 



Energy Efficiency vs. Distance 




Fig. 4. Effect of distance between BS and users on the EE, where N = 1, 
K = 4 are considered. 



power. Moreover, when M > K x N, the increasing of M 
always decreases the EE performance. That is because the 
benefits of transmit diversity gain caused by the increasing 
of M is much smaller than the drawbacks of dynamic power 
increasing. The user number K and transmit antenna number 
M affect the EE in a complicated manner, adjusting these 
parameters adaptively is important for improving the EE. This 
is distinct from the spectral efficient systems, where more M 
always benefits. 

Fig. 2] compares the effect of d on the EE with different M, 
where N = 1, K = 4 are considered. It is observed that the 
optimal BS antenna number is different under different d. For 
example, M = 4 is optimal when d = 0.2km and M = 2 is 
less best, but the trends change when K = 5km, where M = 2 
performs worst. This situation varies due to the tradeoff among 
capacity, transmit power, dynamic power and static power. For 
instance, when the distance is large, transmit power would take 
the main part of the total power consumption, thus, M = 2 



TABLE III 
Simulation Parameters 



Bandwidth 


5MHz 


Noise power 


-HOdBm 


Pathloss 


128.1+37.6 log 10 dij (d 
in kilometers) 


-Pdyn 


83W 


-Psta 


45.5W 


V 


0.38 



with the smallest BS antennas number would consume highest 
transmit power and has the worst EE. How to study these 
parameters in a comprehensive manner is a challenge, which 
should be left for the future work. 

VI. Conclusion 

Based on uplink-downlink duality, the EE of the MIMO BC 
can be transformed into a quasiconcave problem. Based on this 
feature, we propose an energy efficient iterative waterfilling 
scheme to maximize the EE for the MIMO BC based on 
the block-coordinate ascent algorithm. We prove the converge 
of the proposed scheme and validates it through simulations. 
Finally, the effect of system parameters is discussed. 

Appendix A 

As in ( fl4l i the numerator is concave and differentiable, and 
the denominator is convex and differentiable, Theorem Q] can 
be directly obtained based on [20 Proposition 6]. 

Appendix B 

The proof is motivated by lfl4l Appendix II]. 

We prove that [S*] jfe = 0, Vj, k > L at first. Consider S > 
with [S]jk ^ for some j > L and k > L. Based on 
M Appendix II], we have Ek=L+il S U > 0. Thus, we can 
redefine another diagonal matrix S' > as 

\ |S]„+ E |S]„. * = i 

[ 0, k = L+l,...,M 

with Tr(S') = Tr(S) and log |I + S'D;| > log |I + SD;|. 
Hence, G(S', A) > G(S, A). Therefore, [S*] jfe = 0, Vj, k > L. 

We need to prove that S* is diagonal then. Consider any 
S > with [S]jk = for any j > L and k > L but is 
not diagonal. We can have another diagonal matrix S' with 
[S'] fcfc = [S]kk,k = 1,...,M. Based on [BJ Appendix II], 
Tr(S') = Tr(S) and log |I + S'D,| > log |I + SD,|. 

Therefore, the optimal S* is diagonal with [S*]fcfc > for 
k = 1, . . . , L and [S*] kk = for k = L + 1, . . . , M. 

Appendix C 

As in ( TBI the numerator is concave and continuous, and 
the denominator is convex and continuous, F(X) is strictly 
decreasing and continuous based on ||20l . 

Look at (l23l l. we have that F(0) = oo and F(oo) — — oo. 
Therefore, there exists a unique A* with F(X*) = 0. 



Furthermore, as shown in ||20| , in a concave fractional 
program, any local maximum is a global maximum. Therefore, 
the derived A* is global optimal. 
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